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METHOD AND APPARATUS FOR TRANSPORTING ENCRYPTED 
MEDIA STREAMS OVER A WIDE AREA NETWORK 

10 

BACKGROUND 

Many mechanisms exist to encrypt voice over a Public Switched Telephone Network 
(PSTN). For example, customers can deploy devices in front of Private Branch Exchanges 
(PBXs) that encrypt calls made over the PSTN. These devices detect the presence of a 
15 similar encryption device on a far end of the call and then establish an encrypted call with the 
far end device. These encryption mechanisms are used for example by the military and 
defense contractors. 

Secure Real-time Transport Protocol (SRTP) is a standard used for encrypting real- 
time media, such as voice or video, in an Internet Protocol (IP) network. The SRTP 
20 encryption scheme is used to protect voice or video sent over the IP network. Other legacy 
encryption equipment is then used to protect the same voice or video stream when transported 
over the PSTN. 

For example, a Voice Over IP (VoIP) call is encrypted into SRTP packets and then 

transported over an IP network. If the VoIP call needs to travel over a PSTN network, the 
25 media in SRTP packets is decrypted and decoded. The decrypted and decoded media is then 

re-encrypted by a PSTN encryption device prior to being transported over the PSTN. On the 

opposite end of the PSTN connection, the PSTN encrypted media is decrypted and decoded. 

The decrypted media is then re-encoded and re-encrypted back into SRTP packets for 

transport over the IP network to a final destination endpoint. 
30 This multi-stage encryption process breaks end-to-end encryption and requires 

additional jitter buffers and codecs that add significant delay and adversely affect voice 

quality. 

SUMMARY OF THE INVENTION 
35 A network processing device identifies call requests that require secure media 

connections and that also require transport over both a packet switched network and a circuit 
switched network. The network processing device establishes an IP link over the circuit 
switched network and directs endpoints for the media connection to use Internet Protocol (IP) 
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media encryption. The same IP encrypted media is then transported end-to-end over both the 
packet switched network and the IP link in the circuit switched network. 

The foregoing and other objects, features and advantages of the invention will become 
more readily apparent from the following detailed description of a preferred embodiment of 
the invention which proceeds with reference to the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a diagram of a Wide Area Network (WAN) that uses an End-to-End Secure 

Real-Time Transport Protocol (EE-SRTP). 

FIG. 2 is a block diagram showing how EE-SRTP operates in a gateway. 

FIG. 3 is a diagram showing how SRTP packets are transported over an IP link in a 

PSTN network. 

FIG. 4 is a more detailed diagram of a gateway that supports EE-SRTP. 

DETAILED DESCRIPTION 

Referring to FIG. 1, a Wide Area Network (WAN) 13 includes an IP network 34, 35 
and a Public Switched Telephone Network (PSTN) 24. An End-to-End Secure Real-time 
Transport Protocol (EE-SRTP) 22 is used to transport SRTP packets 20 over both the IP 
networks 34 and 35 and over the PSTN network 24. Media is encrypted at a call endpoint 
and then transported to a destination endpoint without having to decrypt and then re-encrypt 
the media when it reaches the edge of the PSTN network 24. The terms PSTN and circuit 
switched network are used interchangeably in the description below. The PSTN and circuit 
switched networks are distinguished from packet switched networks that use an Internet 
Protocol (IP) for transferring data over a network. 

The WAN 13 includes a phone 14 and a computer 15 that are connected to a PSTN 
gateway 16 through IP network 34. The PSTN gateway 16 is connected through a PSTN 
network 24 to another PSTN gateway 26. A phone 32 and computer 30 are connected to 
PSTN gateway 26 through another IP network 35. 

The computers 15 and 30 can be any laptop, Personal Computer (PC), server, etc. 
capable of sending or receiving a media stream. The media stream can include any real time 
media such as audio data, video data, and real time text such as the text used for the hearing 
impaired. The phones 14 and 32 shown in FIG. 1 are Voice Over Internet Protocol (VoIP) 
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5 phones that convert voice signals into IP packets and correspondingly convert voice data 
contained in received IP packets into voice signals. 

The PSTN gateways 16 and 26 are used to convert media between the IP format used 
in the IP networks 34, 35 and the analog or digital formats used in the PSTN network 24. 
The gateways 16 and 28 can also manage the signaling required for establishing the media 
10 calls over IP networks 34, 35, and the PSTN network 24. The EE-SRTP 22 is configured into 
gateways 16 and 22 and enables both gateways to transport SRTP encrypted packets 20 over 
the PSTN network 24 without having to decrypt and re-encrypt media at intermediary hops in 
the call. 

Referring to FIGS. 1 and 2, any combination of the phones 14 and 32, computers 15 

15 and 30, and gateways 16 and 26 are referred to generally below as endpoints, users, senders 
or receivers, etc. One of the endpoints initiates a media call to another endpoint. For 
example, the phone 14 may make a VoIP phone call to phone 32. A caller dials the phone 
number for phone 32 using phone 14. The gateway 16 then determines the IP address for the 
gateway 26 serving phone 32. The two gateways 16 and 30 then conduct a signaling session 

20 that establishes a connection between phone 14 and phone 32. 

In another example, the computer 15 may request a video stream and/or audio stream 
from computer 30 which operates as a music and video content server. In another scenario, 
the computer 15 may wish to make a VoIP call to VoIP phone 32 or to computer 30. The 
computer 15 may establish the connection through gateway 16 or may access the gateway 26 

25 or computer 30 directly through the PSTN network 24, without using gateway 16. 

In the example described below, the phone 14 initiates a VoIP phone call to phone 32. 
Because phone 14 is initiating the call to phone 32, the PSTN gateway 16 will be 
alternatively referred to as the egress gateway 16. The PSTN gateway 26 associated with the 
target phone 32 is referred to as the ingress gateway 26. 

30 The two PSTN gateways 16 and 28 are preconfigured, out of band, with a shared 

secret (key 36) and are also configured with a dial plan 38 of their peers. The dial plan 38, 
among other things, associates phone numbers with IP addresses. The dial plan 38 can also 
identify certain destination phone numbers that require a connection through the PSTN 
network 24. The dial plan 38 can also identify gateways, such as gateway 26, that are 

35 configured with the EE-SRTP 22. 



PATENT APPLICATION 
Attorney Docket No. 2705-320 
Seq. No. 8346 



3 



5 A user (not shown) enters the phone number of phone 32 into phone 14. The call 

request is received by egress gateway 16 in block 50 (FIG.2). The egress gateway 16 
determines from the originating phone number of phone 14 that the call should use SRTP. 
For example, the phone 14 may be associated with a military contractor or some other high 
security communication. In block 54, the egress gateway 16 tells the phone 14 to use only 

10 SRTP for the call. Accordingly, the phone 14 encrypts the media and other information 
contained in Real-time Transport Protocol (RTP) and Real-time Control Protocol (RTCP) 
packets using SRTP or some other type of encryption scheme. 

In block 56, the egress gateway 16 determines whether or not a connection to the 
destination phone number for phone 32 requires access over the PSTN network 24. If not, 

15 the SRTP packets 20 are forwarded in a conventional manner over the IP network 34 in block 
62. The egress gateway 16 determines in block 58 whether or not the ingress gateway 26 
supporting phone 32 is configured with EE-SRTP 22. This information may be located in 
dial plan 38 in general memory or may be located at a call center (not shown) that is accessed 
by egress gateway 16. 

20 If the ingress gateway 26 is configured with EE-SRTP 22 in block 58, an IP 

connection is established over the PSTN 24 in block 60. The gateways 16 and 26 
authenticate each other as described in further detail below in FIG. 3 and then exchange 
^ SRTP encryption keys 18 and 28. The phone 14 sends the SRTP key 18 to the egress 
gateway 16 which then forwards the SRTP key 18 over the IP link 25 to phone 32. The 

25 phone 32 sends a second SRTP key 28 to the gateway 26 which then forwards the second 
SRTP key 28 over the IP link 25 to phone 14. 

Phone 14 then encrypts voice data into SRTP packets using SRTP key 18 and phone 
32 to encrypt voice data using SRTP keys 28. The encrypted SRTP packets 20 from phone 
14 are received by the egress gateway 16 in block 64 and forwarded in the same encrypted 

30 SRTP format over the IP link 25 in PSTN 24. The ingress gateway 26 forwards the SRTP 
packets to the phone 32. Phone 32 then decrypts the packets 20 using SRTP key 18. 

A similar process is used for SRTP packets sent by phone 32 to phone 14. The 
gateway 26 forwards the SRTP packets 20 from phone 32 over the IP connection 25 to 
gateway 16. Gateway 26 does not decrypt the SRTP packets 20. The gateway 16 receives 

35 the SRTP packets 20 from gateway 26 in block 68 and forwards the encrypted SRTP packets 
to phone 14 in block 70. 
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5 If the ingress gateway 26 is not configured with EE-SRTP in block 58, the egress 

gateway 16 processes SRTP packets for the call in a conventional manner in blocks 72-84. 
The SRTP packets are received from phone 14 in block 72 and the SRTP packets are 
decrypted in block 74. The decrypted media is converted into a PSTN format in block 76. 
For example, an RTP payload in the SRTP packets may be converted into a Time Division 

10 Multiplexed (TDM) format for transmitting over the PSTN network 24. The media may then 
be re-encrypted using a conventional PSTN encryption device in block 77 and forwarded 
over the PSTN network 24 to the ingress gateway 26 in block 78. The ingress gateway 26 
has to then decrypt and decode the PSTN call and then re-encode and re-encrypt the call into 
SRTP packets. The ingress gateway 26 then forwards the SRTP packets to phone 32. 

15 Similarly in the opposite direction, the gateway 16 in block 80 receives PSTN 

encrypted data over the PSTN 24 from gateway 26. The gateway 16 decrypts and decodes 
the PSTN media in block 82 then re-encodes and re-encrypts the media into SRTP packets. 
The SRTP packets are then forwarded to the phone 14 in block 84. 

FIG. 3 shows in more detail how the IP connection 25 is established between the two 

20 gateways 16 and 26. It is still assumed that the call is initiated by phone 14 to phone 32 (FIG. 
1). In operation 1, the egress PSTN gateway 16 receives a call request from phone 14. In 
operation 2, the egress PSTN gateway 16 examines the dial plan 38 to determine if the call 
requires SRTP 22. One particular technique for using the dial plan 38 is described in U.S. 
Patent No. 6,584,093 which is herein incorporated by reference. 

25 For example, the dial plan in egress gateway 16 associates SRTP with the originating 

phone number for phone 14. Whenever the phone 14 initiates a call, egress gateway 16 
compares the source phone number in the call request with the SRTP entries in the dial plan 
38. If a match exists, then egress gateway 16 directs the phone 14 to encrypt all media using 
SRTP. The dial plan 38 may also or alternatively link certain destination phone numbers 

30 with SRTP encryption. When the destination phone number in the call request matches an 
SRTP number in the dial plan 38, the egress gateway 16 directs the phone 14 to encrypt all 
media packets using SRTP. 

The egress gateway 16 conducts normal signaling over the PSTN 24 for the call to 
phone 32, except the call is established as an ISDN call (Clear Channel Data) in operation 3. 

35 This means no Digital Signal Processors (DSPs) are consumed on the egress gateway 16 or 
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5 ingress gateway 26. This also means no modem protocols are used between the egress 
gateway 16 and ingress gateway 26. 

After signaling the PSTN 24 and establishing connectivity with the ingress gateway 
26, the egress gateway 1 6 and the ingress gateway 26 perform mutual authentication using 
Challenge Handshake Authentication Protocol (CHAP) in operation 4. The shared secret 36, 

10 previously configured into the two gateways 16 and 28, is used for this authentication. After 
mutual authentication, a Point-to-Point (PPP) session is established over the PSTN 24 in 
operation 5. This PPP session runs IP unnumbered interfaces, but does _NOT_ IP route 
packets from the two IP networks. Rather, the IP and UDP headers are only provided to 
allow running over an unmodified implementation of the stack. 

1 5 After the PPP session is established, in operation 6 the egress gateway 1 6 takes the 

SRTP key 18 which it receives from the phone 14, encrypts the SRTP key 18 using the 
shared secret 36, and sends the encrypted SRTP key 18 to the ingress gateway 26. The 
ingress gateway 26 does a similar operation 7 by encrypting the SRTP key 28 from phone 32 
and sending the encrypted key to the egress gateway 16. This exchange is done using a new, 

20 proprietary PPP protocol-id. 

Now both gateways 16 and 28 know the two SRTP keys 1 8 and 26 and can perform 
call setup with their respective VoIP networks. Gateway 16 decrypts the encrypted key 28 
and sends the decrypted key SRTP key 28 to phone 14. Gateway 26 decrypts SRTP key 18 
and sends the decrypted key to phone 32. 

25 The gateway 16 receives SRTP packets from phone 14 in operation 8. The gateway 

16 removes the IP header and UDP header from the received SRTP packets in operation 9 
and creates a new IP and UDP header in operation 10. The gateway 16 appends the new IP 
and UDP headers to the SRTP packet in operation 1 1 and sends the SRTP packet over the 
PPP ISDN link in operation 12. The RTP header in the SRTP packet is preserved because 

30 SRTP authentication is done over the entire RTP header. Transmission of these SRTP 
packets is done using a new, proprietary PPP protocol-id. 

Thus, the EE-SRTP sends SRTP-protected VoIP traffic over the PSTN by securely 
exchanging the SRTP keys and sending SRTP over PPP. This provides full end-to-end 
encryption of the media stream, without decrypting the stream in intermediate devices. The 

35 full end-to-end encryption is achieved without additional latency due to the encryption as 

would typically be the case if the SRTP packets had to be decrypted in order to encode into a 
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5 Pulse Code Modulation (PCM) format and then re-encrypted using legacy encryption 
equipment. 

FIG. 4 shows in more detail the circuitry used for supporting EE-SRTP. The circuitry 
in one example is located in a network PSTN gateway but could alternatively be located in 
any network processing device used for transmitting IP media packets over different network 
10 infrastructures. 

An IP network interface 92 connects to an IP network 90 and a PSTN interface 108 
connects to a PSTN network 110. One or more processors 1 12 include different software 
programs for performing the different operations used by the EE-SRTP 22. Packet 
formatting software 94 is used to replace the IP and UDP headers for the media packets. Call 

15 signaling software 96 is used for conducting the RTP and RTCP protocol with the IP 
endpoints and for conducting the signaling for establishing the IP link over the PSTN 
network. The SRTP encryption/decryption software 98 encrypts IP media packets when a 
destination gateway is not configured with EE-SRTP 22. 

The PSTN encryption and decryption software 102 is used for encrypting and 

20 decrypting the media when the encrypted SRPT packets cannot be transported "as is" over the 
PSTN 24. The dial plan 38 contains source and destination phone numbers and information 
associated with these phone numbers. For example, source phone numbers that are suppose 
to use SRTP may have a SRTP flag. If a destination phone number requires transport over a 
PSTN network, a PSTN flag may be associated with that destination phone number. IP 

25 addresses are also associated with different phone numbers in the dial plan 38. 

Different Coder/Decoders (codecs) 100 can be used according to the scheme used for 
transporting the IP media over the PSTN network. For example, the IP link 25 established 
over the PSTN 24 may have a bandwidth of 64 thousand bits per second (kbps). Some 
amount of that bandwidth will be used for transporting SRTP packets headers. However, the 

30 headers may not be transported when the SRTP packets are converted into a conventional 
PSTN call. Thus, a first codec 100 with increased data compression might be used when 
encrypted SRTP packets are sent over IP link 25. A second codec 101 with less data 
compression may be used for conventional PSTN transport over the PSTN network. 

A simplified RTP header compression technique, based on RFC2508, can be utilized 

35 to allow larger payloads to still fit within 64kbps. In addition, Multilink PPP (MLPPP) can 
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be used to bind multiple DSOs channels together for one VoIP call, and would allow for 
higher bandwidth voice or video applications. 

The system described above can use dedicated processor systems, micro controllers, 
programmable logic devices, or microprocessors that perform some or all of the operations. 
Some of the operations described above may be implemented in software and other 
operations may be implemented in hardware. 

For the sake of convenience, the operations are described as various interconnected 
functional blocks or distinct software modules. This is not necessary, however, and there 
may be cases where these functional blocks or modules are equivalently aggregated into a 
single logic device, program or operation with unclear boundaries. In any event, the 
functional blocks and software modules or features of the flexible interface can be 
implemented by themselves, or in combination with other operations in either hardware or 
software. 

Having described and illustrated the principles of the invention in a preferred 
embodiment thereof, it should be apparent that the invention may be modified in arrangement 
and detail without departing from such principles. I claim all modifications and variation 
coming within the spirit and scope of the following claims. 
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